314
23
Regulatory Networks
neer biomolecular assemblies for a variety of applications, including the creation of
polymeric materials. 7
23.2 Network Modelling
The proteins are considered as the nodes of a graph (cf. Sect. 12.2), and a pair
of proteins is joined by a vertex if the proteins associate with each other. The
“interactome”—the set of interactions in which a protein participates—is character-
ized by the graph (or an equivalent list of all the proteins in a cell, each associated
with a sublist of the proteins with which they interact). This is in contrast to metabolic
networks, in which two metabolites are joined if there is a chemical reaction (catal-
ysed by an enzyme) leading from one to another (Sect. 23.15). Attention is often
focused on small portions of these networks, which are then called pathways.
All proteins are, of course, gene products. 8 Hence, the fundamental regulatory
network is that of the genes, 9 which constitute the nodes, the edges signifying the
activation or inhibition of other genes, and the central problem is to infer (“reverse
engineer”) both the state structure of the network (cf. Fig. 12.1) and the physical
network of interactions. For the former, the input data are now typically the temporal
evolution of gene expression profiles, obtained by a succession of microarray exper-
iments (cf. Sect. 18.1). For the latter, association is measured more or less directly
using a variety of physicochemical techniques.
The graph of interactions is potentially extraordinarily large and complex (and,
let us reiterate, ignores higher-order correlations). Even if one confines oneself to the
upper NN expressed proteins in a cell, there aretilde upper N squared∼N 2 potential binary interactions and vastly
more higher-order ones; 10 and even if only a small fraction of these interactions
actually occur (and some general results for the stability of systems (Sect. 12.1)
suggest that only about 10% will occur), we are still talking abouttilde 10 Superscript 7∼107 interactions,
assuming about 10 Superscript 4104 expressed proteins (in a eukaryotic cell), and 10 Superscript 8108 pairs would
have to be screened in order to find the 10%. In a prokaryote, with possibly only
1000 expressed proteins, the situation is more tractable but still poses a daunting
experimental challenge, even without considering that many of those proteins are
present in extremely low concentrations.
When one or more stimuli arrive at a cell, the affinities of certain proteins for
a transcription factor binding site (TFBS) are altered, and mRNA transcription is
activated or inhibited, resulting in altered abundance of the mRNA and the trans-
7 Wilson et al. (2018).
8 This statement, the obvious corollary of the central dogma, is actually quite problematical—in
the sense of having a rather ambiguous meaning—when scrutinized in detail. Many functionally
relevant proteins are significantly modified (e.g., glycosylated) by enzymes after translation. Of
course, the enzymes themselves are gene products.
9 Schlitt and Brazma (2005).
10 Many transcription factors, for example, are multiprotein complexes.